Mining the World Wide Web
نویسندگان
چکیده
The World Wide Web has become, over the last years, a major source of information, and at the same time a significant platform for commerce. Both aspects make it an interesting target for data mining applications. In this survey, we will discuss different facets of data mining on the web, and illustrate its methods by typical application areas. These areas will be highlighted in more detail in the subsequent contributions to this special issue of the KI Journal on Web Mining. As internet based applications become more and more intertwined, we will equally consider related domains like email and newsgroups here. The contributions of the special issue indicate new trends in web mining research. Although not specifically requested in the call for papers, most of them focus on one of two issues: the detection of upcoming topics and trends, or the detection and support of online communities. We discuss in this paper that the emergence of these application domains goes together with two technical developments: the Semantic Web for explicitly representing knowledge in the web, and the Web 2.0 as an effort for facilitating user participation in the web. We will argue that the convergence of these two areas – one being an academic, top-down and the other a grass-roots, bottom-up approach – will be a major research challenge for the next years, where web mining will play a significant role.
منابع مشابه
A Technique for Improving Web Mining using Enhanced Genetic Algorithm
World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...
متن کاملA Review Of Trends In Research On Web Mining
In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and other multimedia files available via internet and the number is still rising. But considering the impressive variety of the web, retrieving interesting content has become a very difficult task.So, the World Wide Web is a fertile area for data mining resear...
متن کاملWeb Mining: Knowledge Discovery on the Web
Web mining is the use of data mining techniques to automatically discover and extract information from web documents This paper summarizes the different types of web mining, and their current states of the art. Keywords—Web Mining, World Wide Web, Web Content Mining, Web Structure Mining, Web Usage Mining.
متن کاملWorking of the Web and Web Application for Web Mining
The Web is a popular and interactive medium to interchange information. Web mining shall have a greater significance with the increase of the applications on the internet. It uses various data mining techniques, but it is not an application of traditional data mining due to heterogeneity and unstructured nature of the data available on the World Wide Web. In this regard, the working of World Wi...
متن کاملComprehensive Survey of Framework for Web Personalization using Web Mining
World Wide Web is a global village and a rich source of information. The number of users accessing web sites is increasing day by day. For effective and efficient handling, web mining coupled with recommendation techniques provides personalized contents at the disposal of users. Web Mining is an area of Data Mining dealing with the extraction of interesting knowledge from the World Wide Web. Wh...
متن کاملPrioritize the ordering of URL queue in Focused crawler
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource discovery efficiently. For a crawler it is not an simple task to download the domain specific web pages. This unfocused approach often shows undesired results. Therefore, several new ideas have been proposed, among them a key technique is focused crawling which is able to crawl particular topical...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- KI
دوره 21 شماره
صفحات -
تاریخ انتشار 2007